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A method of fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, comprising the steps of: 

receiving a query comprised of one or more fully specified terms and an 
information need, wherein the information need is represented by one or 
more at least partially unspecified terms; 

identifying contexts in the index that contain the one or more fiilly 
specified terms and zero or more at least partially xmspecified terms; and 
locating one or more matches for the information need within the 
identified contexts. 

The method of claim 1, further comprising the step of: 

identifying documents in the index that contain the one or more at least 
partially unspecified terms. 

The method of claim 1, wherein the locating step further comprises: 
converting the query into a finite state machine; and 
matching the finite state machine against the identified contexts. 

The method of claim 3, wherein the finite state machine is a finite state 

transducer. 

The method of claim 3, wherein the finite state machine allows for the appearance 
of fiiUy specified and at least partially unspecified terms in any order in a 
potential matching context. 

The method of claim 3, wherein the finite state machine allows for one or more 
intervening words between the fiiUy specified and at least partially unspecified 
terms in a potential matching context. 

A method of fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, information indicating category 
restrictions that the terms and contexts satisfy, and identifiers of the documents 
and contexts containing the terms, comprising the steps of: 
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receiving a query comprised of one or more fully specified terms and an 
information need and at least a partial restriction on the order that the one 
or more fully specified terms and the information need may appear in a 
potential matching context, wherein the information need is represented by 
one or more at least partially unspecified terms reflecting a category 
restriction; 

identifying contexts in the index that contain the one or more fully 
specified terms and the one or more at least partially imspecified terms in 
the specified order; 

locating one or matches for the information need within the identified 
contexts. 

The method of claim 7, further comprising the step of: 

identifying documents in the index that contain the one or more fully 
specified terms and the one or more at least partially unspecified terms in 
the specified order. 

The method of claim 7, wherein the locating step further comprises: 
converting the query into a finite state machine; and 
matching the finite state machine against the identified contexts. 

The method of claim 9, wherein the finite state machine is a finite state 

transducer. 

The method of claim 9, wherein the finite state machine assigns a dissimilarity 
weight to appearances of fully specified and at least partially unspecified terms in 
potential matching contexts in orders other than that specified in the query. 
The method of claim 9, wherein the finite state machine allows for one or more 
intervening words between the fully specified and at least partially unspecified 
terms in a potential matching context. 
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A method of fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, comprising the steps of: 

receiving a query comprised of one or more fully specified terms and an 

information need, wherein the information need is represented by one or 

more at least partially imspecified terms; 

converting the query into a Boolean expression; 

identifying contexts identifiers satisfying the Boolean expression; and 

locating one or more matches for the information need within the 

identified contexts. 

The method of claim 13, wherein the identifying step further comprises the steps 
of: 

identifying document identifiers satisfying the Boolean expression. 
The method of claims 1, 7 or 13, further comprising the steps of: 
converting the query into a finite state machine; and 
matching the finite state machine against the contents of the contexts 
associated with the identified context identifiers. 

The method of claims 1,7 or 13, wherein the contexts are stored as finite state 
machines. 

The method of claims 1, 7 or 13, wherein the documents are accessible over the 
Intemet. 

The method of claims 1, 7 or 13, wherein the documents comprise World Wide 
Web pages. 

The method of claims 1, 7 or 13, further comprising the step of: 

accumulating information about the one or more matches as they are 
located. 

The method of claim 1, further comprising the step of: 
assigning a score to a match. 
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1 21 . The method of claim 20, wherein the score reflects the number of times an 

2 instance of the match is located among the plurality of documents. 

3 22. The method of claim 1 , further comprising the step of: 

4 outputting one or more of the matches, or a portion thereof, thereby 

5 providing a resuU for the query. 

6 23. The method of claim 22, further comprising the step of: 

7 outputting identifiers or locations of one or more of the documents that 

8 contains a match or portion thereof that was output in the outputting step. 

9 24. The method of claim 23, wherein a location or a document comprises a imiform 

10 resourese locator. 

11 25. The method of claim 23, further comprising the step of: 

^ 12 ranking the documents that contain a match, and wherein the second 

O 13 outputting step comprises outputting the document identifiers or locations 

^ 14 of the documents that contain a match in an order based on the ranking. 

'9 15 26. The method of claim 25, wherein the ranking step comprises ranking a document 
nj 16 based on the number of times a match is located within the document. 

17 27. The method of claim 1 , wherein the category restriction comprises a 

ni 18 morphological feature. 

O 

yi 19 28. The method of claim 1, wherein the category restriction comprises a syntactic 
^ 20 feature. 

21 29. The method of claim 1 , wherein the category restriction comprises a computer 

22 program. 

23 30. The method of claim 1 , further comprising the step of: 

24 storing a match or a portion thereof 

25 31. The method of claim 30, further comprising the step of: 

26 storing a score for the match or portion thereof 

27 32. The method of claim 1 , further comprising the step of: 

28 storing a plurality of matches or portions thereof 

29 33. The method of claim 1, further comprising the step of: 
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Storing a score for a plurality of matches or portions thereof. 
The method of claim 1, wherein the index comprises locations of terms within 
documents. 

The method of claim 34, wherein the locating step further comprises: 

determining the location of a term in the query within a document using 
the index; and 

locating a match for the query based on the location of the term within the 
document. 

The method of claim 1, further comprising the step of: 

ranking a plurality of the located matches or portions thereof. 

The method of claim 36, wherein the ranking step comprises: 

ranking a located match or a portion thereof based on the content of a 
plurality of documents identified in the identifying step. 

The method of claim 36, wherein the ranking step comprises: 

ranking a located match or a portion thereof based on the content of a 
majority of documents identified in the identifying step. 

The method of claim 36, wherein the ranking is based on one or more features 

selected from the list consisting of: the location of a match within a document, a 

weight assigned to a document that contains a match, the age of a document that 

contains a match, the source of a document that contains a match, and a format 

feature of a match within a docimient. 

The method of claim 36, wherein the ranking step comprises: 

ranking a located match or a portion thereof based on the nmnber of times 
an instance of the match is located within a plurality of documents 
identified in the identifying step. 

The method of claim 36, wherein the ranking step comprises: 

ranking a located match or a portion thereof based on the number of times 
an instance of the match is located within a majority of documents 
identified in the identifying step. 
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The method of claims 36, 38 or 39, further comprising the step of: 

outputting one or more of the located matches, or one or more portions 
thereof, in an order based on the ranking, thereby providing a result for the 
query. 

The method of claim 42, further comprising the step of: 

outputting an indication of the ranking of a located match or portion 
thereof 

A method of fulfilling an information need based on documents stored on a 
computer-readable medium comprising the steps of: 

storing an index identifying documents containing terms; 

storing contexts for terms, wherein a context occurs in a document; 

receiving a query containing an unspecified portion; and 

identifying one or more matches for the query within the contexts. 
A method of fulfilling an information need based on documents stored on a 
computer-readable medium comprising the steps of: 

storing an index identifying documents containing terms; 

storing contexts, wherein a context occurs in a document; 

storing information to retrieve a list of contexts for terms; 

receiving a query containing an at least partially unspecified portion; and 

identifying one or more matches for the query within the contexts. 
The method of claim 45, wherein contexts are preanalyzed. 

A method of fulfilling an information need based on documents and an index 
stored on a computer-readable medium comprising the steps of: 

storing contexts for terms, wherein the context occurs in a document; 

storing information identifying a document in which a context occurs; 

receiving a query containing an unspecified portion; and 

identifying a plurality of matches for the query within the contexts.48. 

A method of generating an index for satisfying an information need fi-om a 
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1 48. A method of generating an index for satisfying an information need from a 

2 plurality of document stored on a computer-readable medium, comprising the 

3 steps of: 

4 receiving a document; 

5 identifying a context in the document; 

6 linguistically analyzing the context;selecting a term from the document; 

7 determining if there are more terms in the context to select, and if so, 

8 selecting another term from the context until there are no more 

9 terms to select; 

10 determining if there are more contexts to identify in the document, 

1 1 and if so, identifying another context in the document and repeating the 
^ 12 term selecting step for each term in the context until there are no more 

p 13 contexts to identify; 

Q 

14 determining if there are more documents to receive, and if so, 

15 receiving the next document and repeating the context identifying and 
fij 16 term selecting steps until there are no more documents to receive. 

1^ 1 7 49. The method of claim 48, wherein linguistically analyzing the one or more 

1 8 contexts further comprises identifying category restrictions satisfied by the 

m 19 context. 

20 50. The method of claim 48, further comprising the step of: 

2 1 storing information indicating the number of contexts stored for a term in 

22 a document, a document identifier associated with the document, and 

23 context identifiers indicating the location of the one or more contexts 

24 stored in the context array. 
25 

26 51. The method of claim 48, wherein the analyzed contexts are stored as finite state 

27 machines. 

28 52. The method of claim 48, wherein the analyzed contexts are stored as graphs. 

29 53. The method of claim 48, wherein the analyzed contexts are stored as trees. 
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The method of claim 48, wherein the category restrictions comprise 
morphological features. 

The method of claim 48, wherein the category restrictions comprise syntactic 
features. 

The method of claim 48, wherein the category restrictions comprise computer 
programs. 

An apparatus for fulfilHng an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents and information indicating category 
restrictions that the terms and contexts satisfy, comprising: 

memory means that stores computer-executable process steps; and 
a processor that executes the process steps so as to (i)receive a query 
comprised of one or more fully specified terms and an information need, 
wherein the information need is represented by one or more at least 
partially unspecified terms reflecting a category restriction; (ii) identify 
preanalyzed contexts in the index that contain the one or more fully 
specified terms and the one or more at least partially unspecified terms; 
and (iii) locate one or matches for the information need within the 
identified preanalyzed contexts. 
An apparatus for fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, information indicating category 
restrictions that the terms and contexts satisfy, and identifiers of the documents 
and contexts containing the terms, comprising: 

memory means that stores computer-executable process steps; and 
a processor that executes the process steps so as to (i) receive a query 
comprised of one or more fully specified terms and an information need 
and at least a partial restriction on the order that the one or more fiiUy 
specified terms and the information need may appear in a potential 
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matching preanalyzed context, wherein the information need is 
represented by one or more at least partially unspecified terms reflecting a 
category restriction; (ii) identify preanalyzed contexts in the index that 
contain the one or more fully specified terms and the one or more at least 
partially unspecified terms in the specified order; and (iii) locate one or 
matches for the information need within the identified preanalyzed 
contexts. 

An apparatus for fulfilling an information need employing an index stored on a 
computer-readable mediimi and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, information indicating category 
restrictions that the terms and contexts satisfy, and identifiers of the documents 
and contexts containing the terms, comprising: 

memory means that stores computer-executable process steps; and 
a processor that executes the process steps so as to (i) receive a query 
comprised of one or more fully specified terms and an information need 
and at least a partial restriction on the order that the one or more fully 
specified terms and the information need may appear in a potential 
matching preanalyzed context, wherein the information need is 
represented by one or more at least partially unspecified terms reflecting a 
category restriction; (ii) identify preanalyzed contexts and documents in 
the index that contain the one or more fully specified terms and the one or 
more at least partially unspecified terms in the specified order by 
converting the query into a Boolean expression and identifying context 
identifiers satisfying the Boolean expression; and (iii) locate one or 
matches for the information need within the identified preanalyzed 
contexts by converting the query into a finite state machine and matching 
the finite state machine against the identified preanalyzed contexts. 
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Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fulfill an information need, the computer- 
executable process steps comprising: 

code to receive a query containing an unspecified portion, the unspecified 

portion including an unspecified term; and 

code to identify one or more matches for the query within a body of 

information stored on a computer-readable medium. 
Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fulfill an information need, the computer- 
executable process steps comprising: 

code to store contexts for terms, wherein a context occurs in a document, 

code to store information identifying a document in which a context 

occurs; 

code to receive a query containing an unspecified portion; and 

code to identify one or more matches for the query within the contexts. 
Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fulfill an information need, the computer- 
executable process steps comprising: 

code to identify a plurality of matches for a partially unspecified query; 

and 

code to rank a plurality of the matches or portions thereof. 
Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fulfill an information need, the computer- 
executable process steps comprising: 

code to identify a plurality of results for a query, the results occurring 

within documents; and 

code to rank the plurality of results based on the content of a plurality of 
documents in which a result is identified. 
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